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RAW SEQUENCE LISTING DATE: 07/05/20(yT* *^ ' \. 

PATENT APPLICATION: US/09/763 , 712A TIME: 10:42:51 

Input Set : A:\37157SEQ.txt OC 1 1 ^ 

' " " CENTEB 160012900 



Output Set: N:\CRF3\07052001\I7637I2A.raw 

3 <110> APPLICANT: Wakaitiiya, N. 



TECH 



5 <120> TITLE OF INVENTION: Novel Collectin 

7 <130> FILE REFERENCE: 19036/37157 
C--> 9 <140> CURRENT APPLICATION NUMBER: US/09/7 63 , 712A 
C--> 9 <141> CURRENT FILING DATE: 2001-05-04 

9 <150> PRIOR APPLICATION NUMBER: JP HEI 10-237611 
10 <151> PRIOR FILING DATE: 1998-08-24 
12 <160> NUMBER OF SEQ ID NOS : 32 

14 <210> SEQ ID NO: 1 

15 <211> LENGTH: 2024 

16 <212> TYPE: DNA 

17 <213> ORGANISM: Homo Sapiens 
19 <220> FEATURE: 

CDS 

(670) . . (1695) 
1 



20 <221> NAME/KEY 

21 <222> LOCATION 



23 <400> SEQUENCE 

24 gtcacgaatc tgcagcaaga taccagcgtg ctccagggca atctgcagaa ccaaatgtat 60 

2 5 tctcataatg tggtcatcat gaacctcaac aacctgaacc tgacccaggt gcagcagagg 120 

26 aacctcatca cgaatctgca gcggtctgtg gatgacacaa gccaggctat ccagcgaatc 180 

27 aagaacgact ttcaaaatct gcagcaggtt tttcttcaag ccaagaagga cacggattgg 240 

28 ctgaaggaga aagtgcagag cttgcagacg ctggctgcca acaactctgc gttggccaaa 300 

29 gccaacaacg acaccctgga ggatatgaac agccagctca actcattcac aggtcagatg 360 

3 0 gagaacatca ccactatctc tcaagccaac gagcagaacc tgaaagacct gcaggactta 420 

31 cacaaagatg cagagaatag aacagccatc aagttcaacc aactggagga acgcttccag 480 

32 ctctttgaga cggatattgt gaacatcatt agcaatatca gttacacagc ccaccacctg 540 

33 cggacgctga ccagcaatct aaatgaagtc aggaccactt gcacagatac ccttaccaaa 600 

34 cacacagatg atctgacctc cttgaataat accctggcca acatccgttt ggattctgtt 660 
3 5 tctctcagg atg caa caa gat ttg atg agg teg agg tta gac act gaa gta 711 

36 Met Gin Gin Asp Leu Met Arg Ser Arg Leu Asp Thr Glu Val 

37 1 5 10 

38 gcc aac tta tea gtg att atg gaa gaa atg aag eta gta gac tec aag 759 

3 9 Ala Asn Leu Ser Val lie Met Glu Glu Met Lys Leu Val Asp Ser Lys 

40 15 20 25 30 

41 cat ggt eag etc ate aag aat ttt aca ata eta caa ggt cea ceg gge 807 

42 His Gly Gin Leu lie Lys Asn Phe Thr lie Leu Gin Gly Pro Pro Gly 

43 35 40 45 

44 cec agg ggt cea aga ggt gac aga gga tec eag gga ccc ect gge cea 855 

4 5 Pro Arg Gly Pro Arg Gly Asp Arg Gly Ser Gin Gly Pro Pro Gly Pro 

46 50 55 60 

47 act gge aac aag gga eag aaa gga gag aag ggg gag cot gga cea cet 903 
4 8 Thr Gly Asn Lys Gly Gin Lys Gly Glu Lys Gly Glu Pro Gly Pro Pro 

49 65 70 75 

50 gge cet gcg ggt gag aga gge cea att gga cea get ggt ccc ccc gga 951 

51 Gly Pro Ala Gly Glu Arg Gly Pro lie Gly Pro Ala Gly Pro Pro Gly 

52 80 85 90 

53 gag egt gge gge aaa gga tct aaa gge tec eag gge ccc aaa gge tec 999 

54 Glu Arg Gly Gly Lys Gly Ser Lys Gly Ser Gin Gly Pro Lys Gly Ser 
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RAW SEQUENCE LISTING DATE: 07/05/200:L^ — ^ — p. 

PATENT APPLICATION: US/09/763 , 712A TIME: 10 : 42 : 51 H t 1 1 V t L) 

Input Set : A:\37157SEQ.txt gpj ^ ^ 2001 

Output Set: N:\CRF3\07052001\I763712A.raw 



55 


95 










100 










105 










110 


TECH 


56 


cgt 


ggt 


tec 


cct 


ggg 


aag 


ccc 


ggc 


cct 


cag 


ggc 


ccc 


agt 


ggg 


gac 


eca 


1047 


57 


Arg 


Gly 


Ser 


Pro 


Gly 


Lys 


Pro 


Gly 


Pro 


Gin 


Gly 


Pro 


Ser 


Gly 


Asp 


Pro 




58 










115 










120 










125 






59 


ggc 


ccc 


ccg 


ggc 


eca 


eca 


ggc 


aaa 


gag 


gga 


etc 


ccc 


ggc 


cct 


cag 


ggc 


1095 


60 


Gly 


Pro 


Pro 


Gly 


Pro 


Pro 


Gly 


Lys 


Glu 


Gly 


Leu 


Pro 


Gly 


Pro 


Gin 


Gly 




61 








130 










135 










140 








62 


cct 


cct 


ggc 


ttc 


cag 


gga 


ett 


cag 


ggc 


ace 


gtt 


ggg 


gag 


cct 


ggg 


gtg 


1143 


63 


Pro 


Pro 


Gly 


Phe 


Gin 


Gly 


Leu 


Gin 


Gly 


Thr 


Val 


Gly 


Glu 


Pro 


Gly Val 




64 






145 










150 










155 










65 


cct 


gga 


cct 


egg 


gga 


ctg 


eca 


ggc 


ttg 


cct 


ggg 


gta 


eca 


ggc 


atg 


eca 


1191 


66 


Pro 


Gly 


Pro 


Arg 


Gly 


Leu 


Pro 


Gly 


Leu 


Pro 


Gly 


Val 


Pro 


Gly 


Met 


Pro 




67 




160 










165 










170 












68 


ggc 


ccc 


aag 


ggc 


ccc 


ccc 


ggc 


cct 


cct 


ggc 


eca 


tea 


gga 


geg 


gtg 


gtg 


1239 


69 


Gly 


Pro 


Lys 


Gly 


Pro 


Pro 


Gly 


Pro, 


, Pro 


Gly 


Pro 


Ser 


Gly 


Ala 


Val 


Val 




70 


175 










180 










185 










190 




71 


ccc 


ctg 


gcc 


ctg 


cag 


aat 


gag 


eca 


ace 


ccg 


gea 


ccg 


gag 


gac 


aat 


ggc 


1287 


72 


Pro 


Leu 


Ala 


Leu 


Gin 


Asn 


Glu 


Pro 


Thr 


Pro 


Ala 


Pro 


Glu 


Asp 


Asn 


Gly 




73 










195 










200 










205 






74 


tgc 


ccg 


cct 


cac 


tgg 


aag 


aac 


ttc 


aca 


gac 


aaa 


tgc 


tac 


tat 


ttt 


tea 


1335 


75 


Cys 


Pro 


Pro 


His 


Trp 


Lys 


Asn 


Phe 


Thr 


Asp 


Lys 


Cys 


Tyr 


Tyr 


Phe 


Ser 




76 








210 










215 










220 








77 


gtt 


gag 


aaa 


gaa 


att 


ttt 


gag 


gat 


gea 


aag 


ett 


ttc 


tgt 


gaa 


gac 


aag 


1383 


78 


Val 


Glu 


Lys 


Glu 


He 


Phe 


Glu 


Asp 


Ala 


Lys 


Leu 


Phe 


Cys 


Glu 


Asp 


Lys 




79 






225 










230 










235 










80 


tct 


tea 


cat 


ett 


gtt 


ttc 


ata 


aac 


act 

i 


aga 


gag 


gaa 


cag 


caa 


tgg 


ata 


1431 


81 


Ser 


Ser 


His 


Leu 


Val 


Phe 


He 


Asn 


Thr 


Arg 


Glu 


Glu 


Gin 


Gin 


Trp 


He 




82 




240 










245 










250 












83 


aaa 


aaa 


cag 


atg 


gta 


ggg 


aga 


gag 


age 


cac 


tgg 


ate 


ggc 


etc 


aca 


gac 


1479 


84 


Lys 


Lys 


Gin 


Met 


Val 


Gly 


Arg 


Glu 


Ser 


His 


Trp 


He 


Gly 


Leu 


Thr 


Asp 




85 


255 










260 










265 










270 




86 


tea 


gag 


cgt 


gaa 


aat 


gaa 


tgg 


aag 


tgg 


ctg 


gat 


ggg 


aca 


tct 


eca 


gac 


1527 


87 


Ser 


Glu 


Arg 


Glu 


Asn 


Glu 


Trp 


Lys 


Trp 


Leu 


Asp 


Gly 


Thr 


Ser 


Pro 


Asp 




88 










275 










280 










285 






89 


tac 


aaa 


aat 


tgg 


aaa 


get 


gga 


cag 


ccg 


gat 


aac 


tgg 


ggt 


eat 


ggc 


cat 


1575 


90 


Tyr 


Lys 


Asn 


Trp 


Lys 


Ala 


Gly 


Gin 


Pro 


Asp 


Asn 


Trp 


Gly 


His 


Gly 


His 




91 








290 










295 










300 








92 


ggg 


oca 


gga 


gaa 


gac 


tgt 


get 


ggg 


ttg 


att 


tat 


get 


ggg 


cag 


tgg 


aac 


1623 


93 


Gly 


Pro 


Gly 


Glu 


Asp 


Cys 


Ala 


Gly 


Leu 


He 


Tyr 


Ala 


Gly 


Gin 


Trp 


Asn 




94 






305 










310 










315 










95 


gat 


ttc 


caa 


tgt 


gaa 


gac 


gtc 


aat 


aac 


ttc 


att 


tgc 


gaa 


aaa 


gac 


agg 


1671 


96 


Asp 


Phe 


Gin 


Cys 


Glu 


Asp 


Val 


Asn 


Asn 


Phe 


He 


Cys 


Glu 


Lys 


Asp 


Arg 




97 




320 










325 










330 












98 


gag 


aca 


gta 


ctg 


tea 


tct 


gea 


tta 


taaeggactg tgatgggatc acatgagcaa 


1725 


99 


Glu 


Thr 


Val 


Leu 


Ser 


Ser 


Ala 


Leu 




















100 


335 








340 






















101 


attttcagct 


cteaaaggea aaggacaetc ctttetaatt 


. geatcacett 


etcatcagat 


1785 


102 


tgaaaaaaaa 


aaaagcaetg aaaaccaatt actgaaaaaa 


. aattgacage 


tagtgttttt 


1845 


103 


taccatccgt 


cattacecaa agacttggga actaaaatgt 


. tceccagggt 


gatatgctga 


1905 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/763 , 712A 



DATE: 07/05/2001 
TIME: 10:42:51 



Input Set : A:\37I57SEQ, txt 

Output Set: N:\CRF3\07052001\I763712A.raw 



104 ttttcattgt gcacatggac tgaatcacat agattctcct ccgtcagtaa ccgtgcgatt 1965 

105 atacaaatta tgtcttccaa agtatggaac actccaatca gaaaaaggtt atcatcccg 2024 

107 <210> SEQ ID NO: 2 

108 <211> LENGTH: 547 

109 <212> TYPE: PRT 

110 <213> ORGANISM: Homo Sapiens 

112 <220> FEATURE: 

113 <223> OTHER INFORMATION: Deduced Amino Acid Sequence of Novel Collectin from 



116 <400> SEQUENCE: 2 

117 Met Tyr Ser His Asn Val Val lie Met Asn Leu Asn Asn Leu Asn Leu 

118 1 5 10 15 

119 Thr Gin Val Gin Gin Arg Asn Leu lie Thr Asn Leu Gin Arg Ser Val 

120 20 25 30 

121 Asp Asp Thr Ser Gin Ala lie Gin Arg lie Lys Asn Asp Phe Gin Asn 

122 35 40 45 

123 Leu Gin Gin Val Phe Leu Gin Ala Lys Lys Asp Thr Asp Trp Leu Lys 

124 50 55 60 

12 5 Glu Lys Val Gin Ser Leu Gin Thr Leu Ala Ala Asn Asn Ser Ala Leu 

126 65 70 75 80 

127 Ala Lys Ala Asn Asn Asp Thr Leu Glu Asp Met Asn Ser Gin Leu Asn 

128 85 90 95 

129 Ser Phe Thr Gly Gin Met Glu Asn lie Thr Thr lie Ser Gin Ala Asn 

130 100 105 110 

131 Glu Gin Asn Leu Lys Asp Leu Gin Asp Leu His Lys Asp Ala Glu Asn 

132 115 120 125 

133 Arg Thr Ala lie Lys Phe Asn Gin Leu Glu Glu Arg Phe Gin Leu Phe 

134 130 135 140 

135 Glu Thr Asp lie Val Asn lie lie Ser Asn lie Ser Tyr Thr Ala His 

136 145 150 155 160 

137 His Leu Arg Thr Leu Thr Ser Asn Leu Asn Glu Val Arg Thr Thr Cys 

138 165 170 175 

13 9 Thr Asp Thr Leu Thr Lys His Thr Asp Asp Leu Thr Ser Leu Asn Asn 

140 180 185 190 

141 Thr Leu Ala Asn lie Arg Leu Asp Ser Val Ser Leu Arg Met Gin Gin 

142 195 200 205 

143 Asp Leu Met Arg Ser Arg Leu Asp Thr Glu Val Ala Asn Leu Ser Val 

144 210 215 220 

145 lie Met Glu Glu Met Lys Leu Val Asp Ser Lys His Gly Gin Leu lie 

146 225 230 235 240 

147 Lys Asn Phe Thr lie Leu Gin Gly Pro Pro Gly Pro Arg Gly Pro Arg 

148 245 250 ' 255 

149 Gly Asp Arg Gly Ser Gin Gly Pro Pro Gly Pro Thr Gly Asn Lys Gly 

150 260 265 270 

151 Gin Lys Gly Glu Lys Gly Glu Pro Gly Pro Pro Gly Pro Ala Gly Glu 

152 275 280 285 

153 Arg Gly Pro lie Gly Pro Ala Gly Pro Pro Gly Glu Arg Gly Gly Lys 

154 290 295 300 

155 Gly Ser Lys Gly Ser Gin Gly Pro Lys Gly Ser Arg Gly Ser Pro Gly 



Nucleotide 
114 



Sequence 
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RAW SEQUENCE LISTING DATE: 07/05/2001 

PATENT APPLICATION: US/09/763 , 7 12A TIME: 10:42:51 



Input Set : 
Output Set: 

156 305 310 

157 Lys Pro Gly Pro Gin Gly 

158 325 

159 Pro Gly Lys Glu Gly Leu 

160 340 

161 Gly Leu Gin Gly Thr Val 

162 355 

163 Leu Pro Gly Leu Pro Gly 

164 370 

165 Pro Gly Pro Pro Gly Pro 

166 385 390 

167 Asn Glu Pro Thr Pro Ala 

168 405 

169 Lys Asn Phe Thr Asp Lys 

170 420 

171 Phe Glu Asp Ala Lys Leu 

172 435 

173 Phe lie Asn Thr Arg Glu 

174 450 

175 Gly Arg Glu Ser His Trp 

176 465 470 

177 Glu Trp Lys Trp Leu Asp 

178 485 

179 Ala Gly Gin Pro Asp Asn 

180 500 

181 Cys Ala Gly Leu lie Tyr 

182 515 

183 Asp Val Asn Asn Phe lie 

184 530 

185 Ser Ala Leu 

186 545 

188 <210> SEQ ID NO: 3 

189 <211> LENGTH: 27 

190 <212> TYPE: PRT 

191 <213> ORGANISM: Artificial Sequence 

193 <220> FEATURE: 

194 <223> OTHER INFORMATION: Modified Consensus Sequence of collectins Hybridizable with 

Novel 

195 Collectin 

197 <400> SEQUENCE: 3 

198 Glu Lys Cys Val Glu Met Tyr Thr Asp Gly Lys Trp Asn Asp Arg Asn 

199 15 10 15 

200 Cys Leu Gin Ser Arg Leu Ala lie Cys Glu Phe 

201 20 25 

203 <210> SEQ ID NO: 4 

204 <211> LENGTH: 21 

205 <212> TYPE: DNA 

206 <213> ORGANISM: Artificial Sequence 

208 <220> FEATURE: 

209 <223> OTHER INFORMATION: Sequence of a Reverse Primer for Screening a Novel 
Collectin . 



A:\37157SEQ.txt 
N:\CRF3\07052001\I7i 

315 

Pro Ser Gly Asp Pro 
330 

Pro Gly Pro Gin Gly 
345 

Gly Glu Pro Gly Val 
360 

Val Pro Gly Met Pro 
375 

Ser Gly Ala Val Val 
395 

Pro Glu Asp Asn Gly 
410 

Cys Tyr Tyr Phe Ser 
425 

Phe Cys Glu Asp Lys 
440 

Glu Gin Gin Trp lie 
455 

lie Gly Leu Thr Asp 
475 

Gly Thr Ser Pro Asp 
490 

Trp Gly His Gly His 
505 

Ala Gly Gin Trp Asn 
520 

Cys Glu Lys Asp Arg 
535 



3712A.raw 

320 



Gly 


Pro 


Pro 


Gly 
335 


Pro 


Pro 


Pro 


Gly 
350 


Phe 


Gin 


Pro 


Gly 
365 


Pro 


Arg 


Gly 


Gly 


Pro 


Lys 


Gly 


Pro 


380 










Pro 


Leu 


Ala 


Leu 


Gin 
400 


Cys 


Pro 


Pro 


His 
415 


Trp 


Val 


Glu 


Lys 
430 


Glu 


He 


Ser 


Ser 
445 


His 


Leu 


Val 


Lys 


Lys 


Gin 


Met 


Val 


460 










Ser 


Glu 


Arg 


Glu 


Asn 
480 


Tyr 


Lys 


Asn 


Trp 
495 


Lys 


Gly 


Pro 


Gly 
510 


Glu 


Asp 


Asp 


Phe 
525 


Gin 


Cys 


Glu 


Glu 


Thr 


Val 


Leu 


Ser 



540 
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RAW SEQUENCE LISTING DATE: 07/05/2001 

PATENT APPLICATION: US/09/7 63 , 7 12A TIME: 10:42:51 

Input Set : A:\37157SEQ.txt 

Output Set: N:\CRF3\07052001\I763712A.raw 

211 <400> SEQUENCE: 4 

212 caatctgatg agaaggtgat g 21 

214 <210> SEQ ID NO: 5 

215 <211> LENGTH: 21 

216 <212> TYPE: DNA 

217 <213> ORGANISM: Artificial Sequence 

219 <2 20> FEATURE: 

220 <223> OTHER. INFORMATION: Sequence of a Forward Primer for Screening a Novel 
Collectin. 

222 <400> SEQUENCE: 5 

223 acgaggggct ggatgggaca t 21 

225 <210> SEQ ID NO: 6 

226 <211> LENGTH: 27 

227 <212> TYPE: PRT 

228 <213> ORGANISM: Artificial Sequence 

230 <220> FEATURE: 

231 <223> OTHER INFORMATION: Consensus sequence of three collectins which were reported 

232 heretofore 

234 <400> SEQUENCE: 6 

235 Glu Asp Cys Val Leu Leu Leu Lys Asn Gly Gin Trp Asn Asp Val Pro 

236 1 5 10 15 

237 Cys Ser Thr Ser His Leu Ala Val Cys Glu Phe 

238 20 25 

240 <210> SEQ ID NO: 7 

241 <211> LENGTH: 24 

242 <212> TYPE: DNA 

243 <213> ORGANISM: Artificial Sequence 
24 5 <220> FEATURE: 

246 <223> OTHER INFORMATION: M13 Universal Primer Sequence for Sequencing 
248 <400> SEQUENCE: 7 

24 9 cgacgttgta aaacgacggc cagt 24 

251 <210> SEQ ID NO: 8 

252 <211> LENGTH: 17 

253 <212> TYPE: DNA 

254 <213> ORGANISM: Artificial Sequence 

256 <220> FEATURE: 

257 <223> OTHER INFORMATION: M13 Reverse Primer Sequence for Sequencing. 

259 <400> SEQUENCE: 8 

260 caggaaaca gctatgac 17 

262 <210> SEQ ID NO: 9 

263 <211> LENGTH: 24 

264 <212> TYPE: DNA 

265 <213> ORGANISM: Artificial Sequence 

267 <220> FEATURE: 

268 <223> OTHER INFORMATION: Sequence of a lambda gtll Reverse Primer for Sequencing. 

270 <400> SEQUENCE: 9 

271 ttgacaccag accaactggt aatg 24 

273 <210> SEQ ID NO: 10 

274 <211> LENGTH: 24 

275 <212> TYPE: DNA 
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VERIFICATION SUMMARY DATE: 07/05/2001 

PATENT APPLICATION: US/09/763 , 712A TIME: 10:42:52 

Input Set : A:\37157SEQ.txt 

Output Set: N:\CRF3\07052001\I763712A.raw 

L:9 M:270 C: Current Application Number differs, Replaced Current Application No 
L:9 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
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